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Abstract — We solve the problem concern- 
ing a time optimal return of a particle with 
a prescribed velocity to the origin by apply- 
ing a magnitude-bounded force. The equa- 
tions of controlled motion are derived and 
explicitly integrated, and the optimal open- 
loop control and optimal time are analyzed 
depending on the parameters of the problem. 
The qualitative behavior of the solution is es- 
tablished, and the solution is compared with 
other regimes of motion. The results are of 
interest for control theory and its applications 
to controlled flight mechanics. 
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I. Introduction 

In the case of general initial and final conditions 
(synthesis), the solution to a multidimensional prob- 
lem faces considerable computational difficulties. Al- 
though the equations of controlled motion can be an- 
alytically integrated in explicit form, the maximum- 
principle boundary value problem can be solved nu- 
merically only for modified special formulations in 
which the final velocity is ignored [1-3]. A numerical- 
analytical study with fixed magnitudes of velocity 
was performed for equal values of the initial and final 
velocities [4]. 

A theoretically and practically interesting case is 
that where the initial and final positions coincide 
while the corresponding velocities differ. It leads to 
planar (two-dimensional) motion for nonparallel ve- 
locity vectors. This plane of controlled optimal mo- 
tion is determined by the indicated vectors. In the 
special case of parallel vectors, the problem degener- 
ates into the one-dimensional case, for which an ana- 
lytical solution can easily be constructed. In the case 
of two-dimensional motion, the maximum principle 
can be used to determine the structure of an opti- 
mal control and to completely integrate the equations 
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of motion. The unknown parameters and the op- 
timal time are numerically determined by Newton's 
method from the maximum-principle boundary con- 
ditions an arbitrary magnitude and direction of the 
initial velocity. Control regimes with coinciding ini- 
tial and final positions (or velocities) can be used to 
construct quasi-optimal global motion. 

II. Formulation of the problem 

The control system under consideration, the ter- 
minal conditions, and the functional are governed by 
the equations 

x = v, x(0) = xq, x(tf)=0 (1) 

v = u, v(0) = v , v(tf) = Vf = (-1, 0) 
M < 1, t f — > min x, v £ R 2 

u 

The more general case of arbitrary values specified 
for the constant mass m, the control force u con- 
strained by | it | < uq, the terminal velocity Vf, the 
coinciding initial xq and final Xf positions, and the 
initial time to can be reduced to problem (1) by us- 
ing suitable substitutions. System (1) is rotationally 
symmetric. In the general case where the vectors x 
and v have a dimension n > 2, the optimal control 
regime is equivalent to two-dimensional motion ([4,5]. 

The control problem has a solution in the class of 
piecewise continuous functions. Necessary optimal- 
ity conditions of the maximum principle type [1] are 
applicable to this problem. 

The solution of time optimal control problem (1) is 
reduced to solving a two-point boundary value prob- 
lem for a control function of the form 

u*=q\q\-\ p(t) = p° = const, (2) 

q(t) = -p°t + q , q° = const, 

where p and q are the variables conjugate to x and v 
(momenta) . 

Substituting the control function u*(t) into the 
equations of motion and performing single and re- 
peated integrations, we obtain representations for the 
velocity v(t) and the position vector x(t), respectively 
[4, 5]. 



As a result, we find explicit representations of the 
desired phase variables x(t) and v(t) in terms of alge- 
braic and logarithmic functions. They have the form 
of "linear" expressions in terms of vectors £ and r\: 

v(t)=vo+ f u*{T)dT = v + V i {t)Z + V T ,(t)ri (3) 
Jo 

x(t) = x Q + v t + Xs(t)£ + X v (t)r) 
u* = Q, Q = -£t + v 

R=\Q\ = ( P 2 t 2 -2apt + l) 1 ' 2 

Here, £ and r\ are the two-dimensional vectors po 
and go normalized by \qo\ > and r\ is the unit vector. 
Thus, p = |£|, pa = (£,7?), and the 2x2 matrix 
(V£ i?? , X^ tV ) is nonsingular in the generic case. The 
scalar functions V^ iV and X^, v depend on time t and 
the parameters p and a in a rather intricate manner: 

Vi = - P - 2 {oV + R)\l V v = p-'Vll (4) 
X 6 = 7^[(-pr + 3a)R + (-2par + 3a 2 - 1)V]* 
+^(1 + ^(0)) 

X ri = p- 2 [-R+(pr-a)V} t --V(0) 

The quantities to be determined are the unknown 
vectors £ and r\ and the optimal time tf. 

III. Solution of the problem 

The boundary value problem is reduced to a sys- 
tem of four algebraic equations for £, if, and tf ob- 
tained from (3) and (4) taking into account boundary 
conditions (1). Instead of £, we introduce the vector 
C = £tf, where \(\ = p, = ptf. This makes it pos- 
sible to separate the unknown tf in the equations. 
The system of two vector equations for £, 77 and tf is 
transformed into 

Vf =t f (a ( C + a v ri) (5) 

v -Vf = tfib^C + b v v) 

The scalar coefficients v and n in (5) are func- 
tions of only p and a: 

a = (p 2 - 2pa+ 1) 1/2 - 1, (6) 

b = ln((p -a+(p 2 - 2pa + l) 1/2 )/(l - a)) 
a c = -(l/2p- 3 )[(p + 3a)a + p + (3a 2 - 1)6] 
a v = b{ = p~ 2 (a + ab), b n = —b/p 



By reducing (5) to three scalar combinations and 
taking into account (6), we obtain a system of three 
transcendental equations for tf, p, and a: (7) 

tf = (f 2 x (p,a)))-^ 2 (7) 

^/oWX/xW))" 1 

fl(v,o)\v \c = -flip, a), c = cos(v ,v f ) 

The functions f x , fo, and /1 depend only on p > 
and \a\ < 1. They are obtained by taking the scalar 
product and have the form 

fl{V,v) ^ a 2 c p 2 + 2a c a v pa + a 2 (8) 

fi (Mi a ) = A* 2fl c( a C + h) + pv(ac(a n + b v ) 

+a ri (a ri + b v )) + a v (a, q + b, q ) 

f 2 (p, a) = (a c + b ( ) 2 p 2 + 2pa(a c + 6 C ) 

x(a v+ b v ) + (a v + b v ) 2 

The second and third equations in (7), combined 
with (8), give the governing system of equations for 
p and a. The unknown tf is then determined from 
the first expression. Note that a similar system was 
solved by a rather complicated separation of variables 
in [5]. That approach is also applicable to the case 
under consideration, but Newton's method was found 
to be preferable. 

IV. Optimal trajectories 

We present the main results on the construction 
of optimal trajectories and examine their basic prop- 
erties (Fig. 1, 2). It was established by contradic- 
tion that the optimal trajectory lies in the angle be- 
tween the initial velocity and the positive abscissa 
axis. Moreover, the properties of u*(t) imply that 
the angle between the position vector x(t) and the 
positive abscissa axis varies monotonically with time 
(see below). 

After the substitution v' Q = vo\vq\~ 2 , the optimal 
time becomes t'j — t/|wo| _1 . The parameters are 
transformed as follows: p' = p, a' = a, £' = £|«o|, 
and r{ = rj. When \vq\ > 1, the solution to prob- 
lem (1) can be reduced to the case < \v' Q \ < 1. If 
\v' Q \ < I «o|, then control u'(t) at the initial point cor- 
responds to the control u(t) at the terminal point. 
When I «o I = 1, the velocities and time do not vary: 
v' = vo and t'j — tf. 

The various segments of the optimal trajectories 
can be interpreted using the solution to the one- 
dimensional problem. The found features of the opti- 
mal control and the optimal time correspond the case 
of parallel vq and Vf (x, v, and u are scalar functions). 
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Figure 1: Family of optimal trajectories for \vq\ 
and various initial velocity directions. 



Figure 2: Family of optimal trajectories for a = 0.757T 
and various initial velocity magnitudes. 
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V. One-dimentional case 

The control function can be rewritten in the form 
-Pot + go 



- Pot + ?o | 



(9) 



Here, r = t/tt is normalized time. Switching occurs 
at t* = 1/A, where A > 1. In view of (9), the phase 
variables have the form 

u(t) — f?sign(— At + 1) 

_ Vf - V 



-sign(2 - A), C = A77 



\v f -v \ 

Terminal conditions (1) for x and v yield the rela- 



tions 



x(r) = V Tt f +r,^(\\T-l\(l-\T)+2\T-l) (10) 



v(t) =v Q + ri 



A(l-|Ar-l|)' 



€ [0, 1] 



t f = 



A(l 



2|u |c) 



1/2 



n 



X(v + 1) 



A 



|2 — A 

A 2 (l-v )-4A + 2u + 2 = 



t/(A-2) 



2± v/2" 



1 - wo 

According to (10), when v 
a constant vector; when vq = 



solutions to the maximum principle problem: opti- 
mal (tf = 0, u(t) = 0) and nonoptimal (tf = 4, 
u(t) = sign(— 2r + 1)). In the case of zero initial ve- 
locity, the optimal time is t°f = l + y/2. The switching 

2 

curve has the form x — =F^- for vq > — 1 and vq < —1, 
respectively. We find the set of initial velocities cor- 
responding to a terminal time tf. For 77 = ±1, we 
have the expressions 



A d 



2t/(t/T2) 



-1/2 



2t 2 f ^Atf 



Let us examine the functions ^(^j) wnen ^/ ^ 2, 
since A 1 * 1 < 1 for tf < 2. The minimum time tf = 2 
is reached at = 1. We have v^(tf) > 1 for tf > 2, 
Uo"(*/) > for 2 < ^ < t° f , and w^"(t/) < for 



tf > if. For the terminal time tf 



t°p we have 



v+(t° f ) = 0, v* =Vo($) = ~V2-2+\JlO + 8V2 

Thus, one value of the optimal time corresponds to 
one velocity when vq < or vq > v* and to two veloc- 
ities in one direction when < vq < v* (see Section 
VI). The found properties of one-dimensional motion 
are used to interpret the solution to the problem in 
the general (two-dimensional) case. 

VI. Numerical simulation 



1, the control u is Figures 1 and 2 show the families of optimal tra- 
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Figure 3: Initial control directions <f> as functions of 
a for various initial velocity magnitudes 
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between the initial velocity direction and the positive 
abscissa axis. The angle characterizing the direction 
of x(t) is a monotone function of time. 

The optimal control is analyzed for various initial 
velocities. The resulting family makes it possible to 
find the optimal control and to determine its features. 
Note the properties of the open-loop control u* (t) at 
the initial and terminal times of motion (Fig. 3, 4). 

We analyze these values as functions of the initial 
velocity direction, i.e., the angle a for fixed velocity 
magnitudes. The desired angles 4> (i.e., the directions 
of u) at the initial time are in the triangle bounded 
by the lines <j> — 2tt, <f) — a + tt, and a = (Fig. 3). 
The object accelerates (i.e., the velocity magnitude 
increases) when (f> > a + 1.5tt and slows down when 
a + n<(f><a + 1.57T. Thus, deceleration occurs 
when |t>o| G [l;+°o) and a £ [0, 7r] and when a e 
[tt/2,tt] and \vq\ > at the initial time. When a S 
[0, 7r/2) and \vo\ G [0,1), the motion switches from 
acceleration to deceleration as a increases. According 
to the solution of the one-dimensional problem, when 
|i>o| G [1, +oo), we have <fi — ► tt as a — ► and <p — ► 2tt 
as a — > tt; when |u | G [0,1), we have <j> — > 2tt as 
a — > or a — > tt (see Section V) . 

A similar diagram can be constructed for tf (Fig. 
4). The range of a is bounded by the lines ip = 
7r, ip = a, and a — 0. The transition between the 
deceleration and acceleration regimes for obtaining 
the required velocity v/ occurs on the line tp = tt/2. 
When a £ [tt/2, 7r] and \v \ > and when \v \ S 
[0,1] and a e [0, tt], the object accelerates toward 



Figure 4: Final control directions i\) as functions of a 
for various initial velocity magnitudes 
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the required terminal velocity. When |«q| G [0,1], 
we have ip — > n as a — > or a — > tt. When |uo| G 
(1, +oo), ijj — > as a — > and t/> — > tt as a — > 7T. 

Thus, for high initial velocities (vq > 1), the behav- 
ior of the control at the initial stage is similar to the 
case of low velocities (vq < 1) at the final stage of mo- 
tion. The constraints on the direction of the control 
vector imply that the velocity component along the 
ordinate axis decreases at the initial time (Fig. 3). 
Moreover, there are no optimal controls under which 
the object accelerates at the initial stage of motion 
and decelerates in the neighborhood of the terminal 
point. 

Now, we analyze the optimal time as a function of 
the initial velocity direction a (Fig. 5,6). When a = 
or 7r, the optimal time corresponds to the solution of 
the one-dimensional problem in Section V. Moreover, 
when \vq\ — 1, tf(a) — > 4 as a — > tt, i.e., the optimal 
time tends to a nonoptimal solution. In the interval 
a G [0, tt], the function tf(a) increases monotonically. 
When a G [0, tt/2] and v G [0, v*], there is a domain 
where tf < f® (Fig. 5). In this domain, the same tf 
can be reached at two different velocity magnitudes. 
Moreover, any two curves intersect when < \vq\ < 
1. 

A transition occurs at the intersection point: as 
a increases, the curve with a higher velocity corre- 
sponds to a longer time; and the curve with a lower 
velocity, to a shorter time. The properties of the 
control-switching curve (see Fig. 3) and the envelope 
of the family of curves tf(a) (Fig. 5,6) imply that, 
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Figure 5: Optimal time as a function of the initial 
velocity direction a £ [0, tt] for various initial velocity 
magnitudes 
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at the intersection point, deceleration is an optimal 
control strategy for an object with an higher velocity 
at the initial stage and acceleration is optimal for the 
control of an object with an initially lower velocity. 
The envelope is the control-switching curve at the 
initial point (Fig. 3), i.e., for fixed a £ [0,7r/2]. The 
minimum optimal time is reached when vq is orthog- 
onal to n. For fixed values a £ [tt/2, tt], the minimum 
time is reached at v = 0. 

Conclusions 

These results make it possible to draw an anal- 
ogy with the case of one-dimensional motion. The 
return of the object from the initial position to the 
same one with turning the velocity can be divided 
into two stages: motion from the initial position to 
an intermediate point X\ with at the velocity v\ and 
motion from this point to the final position. In the 
case of one-dimensional motion, acceleration and de- 
celeration are possible at each stage. These regimes 
determine the properties of the trajectories and the 
control in the two-dimensional problem. 

These features are represented as acceleration and 
deceleration regimes. In the one-dimensional case, 
the velocity magnitude can have three extrema and 
up to four basic control regimes are possible in the 
course of this motion. When a = 0, (f) = 0, and 
< I wo I < 1, we have acceleration, the reversion of 
the control's sign, deceleration, and again accelera- 
tion. When a = 0, <j) = tt, and \vq\ > 1, we have de- 



Figure 6: Optimal time as a function of the initial ve- 
locity direction a £ [0, 7r/2] for various initial velocity 
magnitudes 
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coloration (the initial velocity magnitude decreases), 
then acceleration, the reversion of the control's sign, 
and deceleration. When a — 0, = n, and \v \ = 1, 
the control is a constant vector implementing decel- 
eration and acceleration. When a = 0, we have <j> — 
for all the velocities |«o| > with |«o| 7^ 1. Decelera- 
tion is followed by acceleration, then the sign of the 
control reverses, and again deceleration is followed by 
acceleration. 

Thus, the velocity magnitude attains a minimum 
when a — 0, <p — 0, and \vq\ = 1 and a maximum and 
two minima when a = tt, <fi = 0, and |uo| > with 
\vq\ ^ 1. A maximum and a minimum occur in the 
remaining cases. 

In the two-dimensional case when \vo\ = const and 
a £ (0,7r), the control leads to switching the acceler- 
ation and deceleration regimes corresponding to the 
boundary values of a. This switch occurs at the ini- 
tial stage of the motion if \vq\ < 1 and at the final 
stage if I vol > 1 (Fig. 3,4). 
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